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Abstract*  A particular  solution  of  the  optimum  filter  design  is  presented 
for  the  class  of  problems  where  the  noise  is  normally  distributed 
and  the  data  is  described  by  a normal  distribution.  This  is 
an  application  of  the  general  results  obtained  in  previous 
memos.1*"  This  particular  problem  lends  itself  to  an  analytic 
solution  and  in  fact  turns  out  to  yield  a linear  filter.  This 
linear  filter  has  long  been  used  in  tracking  problems  but  it 
is  believed  that  this  Is  the  first  rigourous  derivation.  Three 
examples  are  given  which  represent  the  complete  range  of  character- 
istics of  the  filter  and  one  is  able  from  them  to  make  statements 
about  the  necessary  "memory"  of  the  filter.  One  is  also  able 
to  draw  conclusions  about  how  often  one  should  sample  a function 
based  upon  the  irregularity  of  the  function  and  on  the  noise 
that  contaminates  the  measurements. 


I.  INTRODUCTION 

As  a direct  application  of  the  general,  concepts  set  forth  in 
previous  reports1*?,  a problem  is  presented  which  not  only  illustrates 
many  of  the  important  general  points,  but  fortunately,  leads  to  an  exact 
analytical  solution.  Under  certain  circumstances  this  problem  may  have 
important  practical  applications,  but  for  the  present  it  will  be  treated 
as  a purely  hypothetical  case. 

We  will  describe  the  problem  in  the  statistical  sense  and  then 
present  three  particular  solutions.  As  a preparation  for  this  it  is 
necessary  to  describe  the  problem  in  terms  that  are  readily  interpreted 
so  as  to  define  the  two  functions  of  the  filter,  detection  and  selection. 

The  problem  to  be  treated  is  one  which  occurs  in  systems  that 
are  designed  to  track,  in  one  dimension,  on  the  basis  of  sampled  data. 

The  characteristics  of  the  track  and  of  the  noise  that  contaminate!  the 
data  are  given.  FTom  these  characteristics  we  are  able  to  compute  on 
the  basis  of  the  received  data  the  probability  distribution  functions  for 

1 M-1812, "The  Philosophy  of  Statistical  Filter  Design,"  W.I.  Wells,  Jan.  27,  1953. 

^ M-l886,"The  Specification  of  an  Ideal  Detector  as  a First  Step  in  Filter 
Design,"  W.I.  Wells,  March  6,  1953* 
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the  expected  position  and  velocity  of  the  track  after  the  reception  of  data* 
This  is  the  detection  problem. 

The  selection  problem  comes  in  choosing,  from  this  probability 
density  distribution  function,  the  value  of  position  and  velocity  that 
represents  the  "best"  guess  of  position  and  velocity  under  the  constraints 
imposed  by  the  desired  overall  function  of  the  filter.  This  action  of 
selection  will  be  discussed  first  since  its  action,  in  this  case,  is  such 
that  we  simplify  the  calculations  if  it  is  known* 

We  suppose  that  the  probability  distribution  function  has  been 
calculated.  We  now  desire  to  find  the  spot  at  which  we  next  expect  to  find 
the  track  when  the  next  data  arrives.  If  we  are  going  to  examine  the 
following  data  in  such  a way  that  only  data  that  is  extremely  close  to 
the  predicted  position  will  be  used,  we  will  be  obliged  to  look  at  the 
most  probable  value.  If,  however,  we  are  going  to  look  for  data  that  may 
be  anywhere  in  the  general  neighborhood,  we  may  choose  another  criterion. 

The  mathematical  statement  of  this  problem  is  that  we  wish  to  center  our 
area  of  search  on  the  probability  distribution  curve  in  such  a way  as  to 
cause  the  product  of  these  two  curves  to  include  the  most  area.  Under 
these  conditions  we  will  have  the  greatest  probability  of  "seeing"  the 
data  that  comes  In  next  time.  In  order  to  do  this  process  in  a mathematical 
way  we  note  that  this  is  just  the  convolution  integral  representation. 

Thus  we  construct  a function  which  represents  our  effective  area  of  search 
and  convolve  this  with  the  probability  distribution  function.  Then  the 
maximum  point  on  the  resulting  curve  is  the  correct  place  to  put  the  center 
of  area  of  search  in  order  to  maximize  the  probability  of  "seeing"  the  new 
data. 

We  note  that  if  the  probability  distribution  curve  is  symmetrical 
with  its  highest  point  as  the  center  and  that  the  curve  of  search  area  is 
symmetrical,  then  the  convolution  of  these  two  leads  to  the  fact  that  the 
highest  point  is  there  the  search  area  curve  is  centered  on  the  highest, 
center  point,  of  the  distribution  curve.  If  this  is  the  case,  one  may 
just  forget  about  this  convolution  operation  and  look  for  the  center  or 
maximum  of  the  probability  distribution  function.  This  is  the  case  in 
the  problem  being  reported  in  this  paper.  It  will  be  shewn  that  the 
probability  distribution  functions  come  out  as  normal  distributions,  and 
since  the  search  area  is  assumed  symmetrical,  we  need  only  ask  for  the 
highest  point  on  the  normal  curve  as  the  best  place  for  us  to  center  our 
search  area. 

The  problem  from  here  on  is  to  do  the  job  of  detection,  that  is, 
to  calculate  the  probability  distribution  curves.  It  will  be  understood 
that  the  process  of  selection  merely  picks  out  the  peak  of  the  distribution 
curve  so  we  will  give  formulae  that  do  this  as  part  of  the  following 
calculations.  Our  function  that  we  expect  to  filter  is  X(t)* 
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II.  THE  PROBLEM 

First  we  shall  discuss  the  characteristics  of  X(t).  We  choose 
to  say  that  the  second  derivative  of  X(t)  is  a constant  in  the  intervals 
between  samples.  This  is  an  arbitrary  representation  but  as  the  samples 
become  closer  together  the  representation  becomes  exact*  This  means  that 
the  curve  X(t)  is  to  be  approximated  with  a sequence  of  parabolas,  each 
connecting  two  adjacent  sample  times  in  such  a way  that  the  curve  of  X(t) 
is  smooth,  i.e.,  the  curve  is  continuous  at  the  sample  times,  with  the 
second  derivative  assuming  new  values  for  each  interval.  We  will  call  the 
first  derivative  the  velocity  and  the  second  derivative  the  acceleration. 
We  have  not  yet  completely  described  the  process.  For  ary  set  of  samples 
one  may  fill  in  several  smooth  curves  with  the  above  characteristics. 

See  Fig.  1. 


The  remaining  restriction  must  tell  which  curve  will  be  used  for  the 
approximation.  We  do  this  by  requiring  that  the  probability  distribution 
density  of  the  acceleration  be  a normal  distribution  centered  at  zero. 

That  is,  the  smaller  acceleration!  are  more  likely  than  the  larger  ones, 
hence,  the  approximating  curve  will  be  the  one  with  the  least  violent  turns . 
This  means  in  the  above  figure,  (Fig.  l),  the  continuous  curve,  (A),  is 
much  more  likely  than  the  dashed  curve,  (B).  We  write  this  distribution 
as 


W(a)  - 


where  (a)  is  the  acceleration, 


(1) 


is  the  variance  of  the  normal  distribution. 


Next  we  will  discuss  the  type  of  noise  to  be  considered.  When 
we  sampled  the  curve  we  make  a measurement  of  the  value  of  X at  a certain 
time.  If  this  measurement  is  exact,  there  is  no  noise.  We  wish  to  consider 
the  particular  case  where  there  is  some  doubt  as  to  the  exact  value  of  X. 

In  fact  we  assume'  that  the  exact  value  of  X is  normally  distributed  about 
the  measurement  value  of  the  staple.  Let  the  n'th  sample  be  called  S • 

Then 

- H>2 

where  O'  is  the  variance  of  the  normal  distribution. 
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III.  SOLUTION 


This  conpl.etes  the  characterization  of  the  problem.  Next  we  shall 
show  how  these  two  distributions  may  be  used  to  calculate  the  probability 
distribution  function  of  predicted  X and  V.  (Detection) 


We  begin  by  assuming  that  we  have  received  the  first  sample  (S_) 
at  t-1.  Nor  we  may  ask  for  the  probability  distribution  of  X.  This  is 
obviously 


\(1) 


1 

/2n~ <r  • 


(x-s^2 

Tip 


(3) 


Now,  there  is  a time  elapse  before  the  next  sample  and  two  effects 
become  apparent.  First,  if  there  is  a first  derivative,  or  velocity  (V), 
at  t-1,  then  we  can  say,  if  the  acceleration  (a)  is  zero,  that  at  t-2  the 
distribution  will  be  the  same  as  for  t-1  except  thab  X will  be  increased 
to  X+V.  We  measure  (V)  and  (a)  in  terms  of  the  sai^le  Interval  so  that 
V(t2-ti)-  V.  In  other  words,  if  a-0  then  at  t-2 


-(x-y-s^2 
*2™  " e ~TJP 


CM 


where  V is  the  velocity  at  t-1.  We  can  imagine  this  as  a distribution 
curve  which  is  a function  of  both  (X)  and  (V).  We  plot  this  on  an  (X,,V) 
plane  where  the  height  above  the  plane  is  the  height  of  the  probability 
density  distribution.  When  the  first  sample  was  received,  we  had  a 
plot  as  follows  1 


Fig.  2 


1 


\ 


I 


1 
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We  see  here  that  the  "mound"  is  uniform  in  the  7 coordinate.  This  means 
that  after  the  first  sanple  is  received,  one  knows  nothing  about  derivatives 
and  all  velocities  are  equilikely.  In  addition,  no  matter  where  we  cut 
through,  the  X distribution  is  normal  about  S^. 

Now,  if  a*0,  we  can  draw  the  distribution  at  t»2.  We  note  that 
if  the  initial  velocity  is  zero,  the  distribution  curve  is  the  same.  Thus, 
on  the  X axis,  where  7*0,  the  "mound"  remains  fixed.  If  the  velocity  is 
some  other  value,  the  mound  shifts  to  the  right  a distance  7.  One  sees 
that  the  whole  mound  Is  twisted  as  follows. 


It  is  seen  to  be  skewed  with  respect  to  the  7 axis.  This  means  that  if 
the  position  at  t»2  is  found  to  be  S the  most  likely  value  for  7 will  be 
found  at  7*S«-S-,  where  this  mound  has  its  highest  point,  which  intersects 
the  line  X - Sg. 

Now  we  examine  what  happens  to  this  distribution  if  the  acceleration 
is  not  zero.  Suppose  it  is  exactly  (a).  Then  we  know  that  in  the  time 
tg-t^,  X will  increase  to 

? «> 
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since  (a)  is  constant  In  the  interval.  We  ask  what  this  has  done  to  the 
’’mound"  that  we  started  with  in  Fig.  2.  Obviously  if  a»0,  we  get  Fig.  3» 
If  (a)  is  some  number,  the  mound  retains  its  same  cross-sectional  shape 
but  is  now  centered  at  X*S^*a/2  on  the  X axis  and  is  parallel  to  the  line 

27  - X^  (7) 

rather  than 

V - X-^  (8) 


as  in  Pig.  3* 

The  formal  procedure  for  accomplishing  transition  from  Pig.  3 

to  the  desired  curve  is  the  convolution.  When  the  acceleration  is  fixed 

at  a known  value  (a  ) its  distribution  function  is  an  impulse.  If  this 

0 

is  thought  of  as  a function  of  V,  we  have  UQ(V-ao).  Uo(X)  is  the  unit 

impulse  function.  It  is  zero  everywhere  except  at  X-o,  and  its  integral 
over  X is  one.  Convoling  this  with  a distribution  containing  V,  merely 
replaces  V by  V-a,  which  is  the  desired  transformation.  Likewise,  if 
the  impulse  is  written  UQ(2X-a),  and  convolved  with  a distribution  con- 
taining X,  we  replace  X by  X-a/2.  So  we  have  for  the  case  above 


W1(6-V,r)U0(  <r-X-  |)Uo(  6 -V-a)d  6 d 


r. 


(9) 


Both  steps  have  been  combined  here.  We  note  W^(  6”  -V,  T)  has  had  X replaced 

by  X-V  to  account  for  the  initial  velocity  at  t»l.  (This  is  the  transition 
from  Fig.  2 to  Fig.  3*)  The  convolution  shifts  the  curves  further  to  allow 
for  the  acceleration  during  the  interval. 

It  is  helnful  to  think  of  this  process  in  terms  of  filters. 

We  note  that  as  W(a)  is  an  impulse,  we  pass  the  function  W^(X-V,V)  through 

a "two-dimensional"  filter  whose  impulse  response  is  an  impulse,  delayed 
(a  ) units  in  the  V direction,  aQ  units  in  the  X direction.  Since  the 

0 r 

operations  of  integration  and  differentiation  involved  are  linear,  it 
is  proper  that  when  W(a)  is  different  from  an  impulse,  the  iraoulse  response 
of  the  filter  will  also  be  different.  In  fact  it  will  be  W(a),  also 
shifted  (a)  units  in  the  7 direction  and  a/2  units  in  the  X direction. 

The  needed  filter  has  the  impulse  response  related  to  W(a)  as  follows 

W/a)  . — > Wjv)vq[f)  (16) 

We  note  that  in  the  X,V  plane  these  convolutions  all  take  place  along  a 
straight  line  V-2X  - const.  Por  convenience  we  change  variables  from 


+ 


* 


61 

Memorandum  M-I898  Page  7 


1,7  to  W,Z  as  follows  . 


where  Z-const.  is  the  equation  of  V-21-const.  In  terms  of  W,Z  we  should 
expect  that  the  impulse  response  of  our  filter  contain  only  V.  The  equations 
for  the  transformed  coordinates  are: 


5 

z » 

V-2X 

5 

w - 

2V*X 

V - 

2>2W 

5 

X - 

-2Z4V 

* 

C. 


n 


(id 


We  note  that  the  filter  corresponding  to  a fixed  known  acceleration 

a 

U0(X-  ijr)Uo(V-ao)  becomes 


( -btg  -*o)  u (h2  - *„) 
° /T  0 /T 


This  has  a value  only  where  both  brackets  are  zero,  thus! 

- bZ*2W  - * m n m ZeW  - a„ 

7f  ’ 0 7T 

which  is  satisfied  only  for  Z ■ 0 and  the  impulse  response  becomes 


B»(7r  0) 


(12) 


(13) 


V 

■n 

•? 

■ i 
1 < 

i! 
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or  in  general 


if(a) 


w,c  T$ 


dk) 


Nov  ve  return  to  the  problem  laid  out.  The  distribution  of  acceleration 
was  to  be  normal. 


¥(a) 


|/2 Hot. 


Dropping  the  constant  multiplier,  ve  have  from  Eq.  lU> 

- 

2cC^ 


¥(¥) 

i 

e 

Let 

s - 

k 

10o(  2 

- ^¥2 

¥(¥) 

i 

e 

(IS) 


(16) 


(17) 


The  normal  distribution  of  time  position  about  observed  position  may  be 
written 


WC^Z) 


-p(¥-2Z 


where  p 


10  C 


h - | IT  sx 


(18) 

(19) 


Now  vith  the  notations  ve  ma 7 begin  the  process  of  constructing  the 
probability  density  curves  for  ¥(¥,Z)  after  receiving  several  pieces  of 
data. 


We  receive  the  first  sample  and  at  t»l  ve  hav# 


¥(¥,2,^) 


-P(¥-2Z-b1)2 

e 


(20) 


r 


stands  for  "is  proportional  to." 


I 


k 


■4 
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We  replace  X by  X-V  to  allow  for  Initial  velocities.  This  ie  done  in 
W,  Z by  replacing 

W by  1/$(3W-Z) 

Z by  1/5(7Z*UW) 

giving! 

-p(W*3Zeb. )2 

e A 

- 6 w2 

We  now  con voire  this  with  e 

W(W,Zat2)  A ‘ fa  0#f3Z*b1)2  (23) 


J 


which  gives 1 


(21) 


(22) 


( > 


This  is  the  distribution  Jpst  before  the  reception  of  the  second  sample. 
When  the  second  sanpie  is  received  we  have! 


W(W,Z,t?) 


" fa*-  (Wf3Z^>x)2 

© 0 


- 0(V-2Z-b2)2 


(2U) 


Again  we  change  variables  to  allow  for  initial  velocities  and  convolve 
with  e , getting 


w(w,z,t3)  •- 


♦ $6  (Wt3Zeb2)2 

♦ fer  w**V 


(2$) 


This  is  the  distribution  just  before  the  reception  of  the  third  sanpie. 
The  values  of(W  and  Z)(X,V)  which  maximise  this  egression  are  the  best 
guess  at  the  predicted  position  and  velocity  at  the  third  sanpie. 

From  the  length  of  this  expression  one  notes  that  the  algebra 
is  becoming  very  long.  Thus  we  shall  attempt  to  find  a recursion  f omuls 
to  be  used  in  numerical  examples.  We  note  that  by  further  manipulations 
each  of  the  previous  expressions  can  be  put  in  the  formi 

_ A1(W*BZ^1)2  - X2(Z*C2)2 
e 


> 

I "i 

! I 


14 


(26) 
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Since  the  expressions  retain  form  we  merely  choose  an  appropriate 
beginning  and  calculate  the  A,B,C  of  one  time  in  terms  of  those  at  one 
preceding  time. 

A convenient  spot  to  begin  is  where  we  have  just  received  a 
sample  and  thA changed  variables  to  allow  for  initial  velocities.  Then 
we  have 


, - *UA  ('WU*Cln-l>2  - 


where  this  gives  the  distribution  of  (V)  and  (XeV)  after  the  n-1  sample. 

-cTw2 

Now  we  convolve  this  with  to  get  the  distribution  of  the  predicted 

X and  V.  Then  we  receive  the  n*th  sample  and  change  variables  to  allow 
for  initial  velocities.  Putting  this  back  in  the  above  fora,  we  have 

^ * *8»-l  M * " (28) 

X[L  . ♦ pC2]e  pA-  . 

-A^  • 1 ^ (29) 

„ _ * *2„-l  ft  * » 


. c2  * ». 


f 


, 1 
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where 


X - 


*ln-l  & 


I • 


^n-l  ♦ 3 


Q - 


7Bn-l  -1 

5 


U - B , ♦ 2 
n-1 


(33) 


S ■ n'th  sample* 
n 


These  are  the  desired  recursion  relationships.  We  may  now  ask  for  the 
most  probable  (W,Z)  or  (X, V) . These  are  the  coordinates  of  the  peak  of 
the  "mound"  that  we  hare.  These  are  found  to  be. 


I 


I 


If 


1 

l—f 

ft 

> 

!*c 

. (3U) 

X<*1  • (V8)  -01” 

05) 

X . , ■ X eV  is  the  predicted  position  for  the  following 
n n (n+l)  sample. 

C.  C, 

X„  - (3-B.  ) * Jg. 

" \fT  fT 

(36) 

Somewhat  more  useful  formulae  are  obtained  when  one 
may  be  written  as  follows! 

notes  that  these  equations 

t 

V - V ♦ M(S  - X) 

n n-i  n n 

(37) 

X • X ♦ N(S  - X ) 

n n n n 

(38) 

where! 

M - 


(ZB^-l)  (XTUp  ♦ Vfl^P) 


t 


■'t 


\ , 


L >■' 

■Y 
V ■ 


(39) 
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N - 


(3-B^)  (XIUp  ♦ VSA^P) 


V1 


2n 


^ln 


(ho) 


The  size  of  the  distributions  are  of  interest,  especially  in 
the  X direction,  since  the  samples  are  received  in  X.  To  get  the  unconditional 
distribution  of  X,  we  merely  integrate  W(X,7)  over  all  V.  This  leaves 
a normal  distribution  in  X of  the  form: 


(hi) 


If  we  ask  for  the  distribution  of  X , we  have 

n* 


V3'Bl/  * *2n 

6 - 


and  for  have 


C 


» * V*  * ** 

10  fex  ^ . 


(U2) 


(U3) 


These  are  all  of  the  relations  that  are  needed  to  solve  any  particular 
problem*  The  particular  problems  are  described  by  giving  o(  and  . 

IV.  DISCUSSION 


First,  let  us  discuss  the  outstanding  characteristics  of  the 
solution  and  then  present  some  examples.  To  actually  construct  the 
optimum  filter  we  need  know  Eqs.  37  and  38,  and  the  appropriate  values 
of  M and  N for  the  particular  data.  This  filter  has  an  important  characteristic 
which  is  not  usually  so  evident.  M and  N are  functions  of  the  number  of 
pieces  of  data  received.  In  other  words,  when  one  first  begins  to  filter 
data,  the  filter  uses  one  set  of  values  of  M and  N,  then  as  more  and  more 
data  are  received,  these  values  of  M and  N change,  eventually  reaching 
a fixed  value  asymptotically.  In  many  cases  the  "transient"  behavior 
is  over  very  quickly  and  one  looses  very  little  by  using  the  fixed  filter 
straight  through.  On  the  other  hand,  there  are  cases  where  the  final 
form  of  the  filter  is  useless  unless  the  "transient"  is  used  first. 

The  most  striking  example  of  this  behavior  is  that  where  the  acceleration 
is  zero  at  all  times.  In  this  case  X(t)  is  a straight  line.  The  final 
values  of  M and  N are  zero.  This  occurs  only  alter  an  infinite  number 
of  samples  have  been  received.  Thus,  during  the  "transient"  the  filter 


4 
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determines  the  straight  line  and  then  after  a long  time  no  more  data  Is 
accepted.  In  this  case  the  "transient"  behavior  is  the  Important  part. 

One,  therefore,  sees  that  we  cannot  ask  for  the  final  values  of  M and  N 
and  automatically  expect  to  have  .a  good  filter. 

In  many  cases,  however,  the  transient  is  relatively  short  and 
new  data  is  weighted  rather  heavily  and  thus  only  small  errors  are  encountered 
by  using  a fixed  filter  with  the  final  values  of  M and  N.  Whether  this 
procedure  is  possible  depends  upon  the  relative  values  of  and  6 ' • 

In  any  case,  the  procedure  for  calculating  the  optimum  filter  is  given 
above • 


The  next  thing  to  be  considered  is  the  "goodness"  of  the  filter. 
Even  though  the  filter  may  be  "optimum"  it  may  not  do  enough  good  to 
warrant  its  use.  Therefore  we  ask,  how  much  sharpening  of  the  distribution 
curves  do  we  get  by  using  this  filter  with  several  pieces  of  data.  Since 
W(X)  is  a normal  curve,  its  variance  ft" 2 gives  its  sharpness*  Eqs.  1*2,U3 

give  ft  ^ for  -the  whole  fanily  of  filters  after  any  number  of  sables. 

Is  the  nuidber  of  samples  increase, ft  ^ decreases  asymptotically  to  a fixed 
value  which  represents  the  "best"  that  the  filter  can  do  with  this  particular 

a g 2 

type  of  datal  For  a good  filter  Q should  be  considerably  smaller  than  6~  . 

We  will  now  give  three  simple  examples.  The  first  leads  to  the 
straight  line  smoothing  equations. 

7.  EXAMPLE  1. 

Let  o/  • 0.  This  says  a -.0,  thus  X(t)  is  a straight  line. 

We  receive  our  first  sample  and  have 


T? 


which  after  a shift  due  to  velocity  and  change  of  variable  becomes  t 

4 


-pCwo*^)' 


From  Eq.  27.  *11  A2l"°  B1  " 3 


From  Eq.  16 


6 - 


m OO 


10 


From  Eqs.  33  X ■ p 

Eq.  28  L^2  - 10p 

Eq.  29  A22  - S/2P 

■an  n - i/o 


T 
T - 3 


C11  - bl 


Q - k U - 5 


C21  " 0 


Vy,  . 
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Eq.  39,1*0 


M ■ 1 N.  - 1 

2 2 


Eq.  1*2 


^ 1 - 5 tr 
3 


Now  repeating  these  steps  one  finds  in  general 


*n  ' ' n(n*l) 


Hn  ' (JUSl)  ) 


tere  n is  the  number  of 


V 2 _ i*n-2  _2 

n 

V 2 _ 2(2n*l)  2 


< 


r v 
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VI.  EXAMPLE  2. 

Let  <T  - 0,  which  says  there  is  no  noise,  hence  we  should  expect 
very  little  filtering. 

We  being  again  by  receiving  the  first  sample,  converting  to  (W,Z)  and  allowing 
for  initial  velocities!  this  gives  as  before t 


-*(*♦32^)  2 


All-?- 00  A2l"°  Bl-3  °ii  " bl  C21  m 0 


P 


■ 00 


io<r 


I T-3  Q - L 0-5 

Alj-P-oo  ^-2  5/-  Bj-3 

*2-1  Nj  - 1 

Bepeating. 

X - <T  T-3  Q - it  U-5 

- P •*>  A^  - 50<T  b3  - 3 

M3  - 3/2  N2  - 1 


^ -°  ars*. 3/8^ 2 

Again: 

All*-*5  A2L"75^  V3 

- 5/3  n2  - 1 

- 0 - 1/3^  2 
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or  in  general i 

N 

n 

- 1 

a 

l\) 

a 

o 

21 

ro 

« 

-•< 

We  see  that  the  filter  reaches  a nearly  fixed  value  after  about  6 pieces 
of  data.  In  this  case  we  conclude  that  the  *transient"  is  fairly  short 
and  reasonably  good  results  will  be  obtained  by  using  the  filter  M»2,  N*l, 
from  the  beginning.  Since  there  is  no  noise  in  the  data,  the  smoothed 
positions  are  obviously  not  in  error.  The  predicted  positions  are  such 

y 2 2 V 2 2 

that  there  is  an  inprovement  of  Q ■ .5  ©(  to  q ■ .2$o<  , for  the 
final  filter. 

Both  of  the  above  examples  could  have  been  worked  by  simpler 
means.  The  third  example  is  a more  realistic  problem  and  also  the  results 
do  not  come  out  so  nicely;  however,  by  continuation  of  the  above  processes 
we  can  work  this  example  for  as  many  steps  as  we  desire. 
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VII.  EXAMPLE  3. 

Let  p - £ - 1,  thus  $ 2 - 1/10  o{  2 " fc/10  • 

This  is  the  conjoined  case  of  noise  plus  non-straight  line  signals. 
Again  ire  find: 

*11  - P ’ 1 *21  "0  Bl"3  Cll’bl  °21  " 0 

X 1-3  Q - k U-5 

^12  " 11/2  A22  " if  ®2  " IT 

m2  - 1 k2  - 1 

if?-  1/10  • tf"2  2 - 7/10  - 7 ($"  2 

repeating  X • 11/13  T • Q • 

A13  - 72/13  A23  » B3  - 29/18 

M3  - 3A  N2  - 7/8 

#7  ■ 7/60  - 7/8  (T  ^ S'2 

again  < 

X - H I - 17/9  Q - ^ U - 63/18 

*11*  " F A2U  “ BU  " U7/29 

^ - li2/35  Iu  • U7/55 

- U7/550  - 1*7/55  <5"  2 ^ 2 - - 2$  S'* 
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We  see  for  this  filter,  the  transient  is  over  within  about  li  samples. 
Also,  we  see  no  great  Improvement  in  the  smoothed  data,  i.e.,  from 

1 to  -.85.  The  predicted  position  has  a distribution  roughly/5.91 
n 

or  2.1*  times  as  wide  as  the  received  data.  This  is  to  be  expected  since 

witho(  2 ■ U/10,  there  is  a fairly  large  possibility  of  variation  from 
a straight  line. 


v 

£ 


These  three  examples  illustrate  the  two  extreme  problems  and 
one  intermediate  one.  Any  other  problem  of  this  type  can  be  worked  out 
using  the  same  recursion  formulae. 

It  should  be  repeated  that  for  every  combination  of 
one  can  determine  M and  N (actually  these  depend  only  ono(/  6")  which 
completely  define  the  linear  filter;  then  one  also  may  calculate  the 

0 2«s  which  describe  the  quality  of  the  filter.  There  is  one  further 
criterion  which  could  be  easily  calculated.  This  is  the  variance  of 
the  unconditional  probability  distribution  of  velocity.  It  was  not 
included  since  it  is  of  secondary  importance,  when  the  data  involves 
X only. 
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Till.  CONCLUSION 

The  complete  solution  of  the  design  of  the  desired  filter  is 
now  given.  This  involves  the  two  processes,  detection  and  selection* 

In  this  particular  case  the  optimum  filter  turned  out  to  be  linear. 

It  is  conceivable  that  for  the  same  problem,  but  for  a different  search 
area  criterion,  one  would  find  a filter  that  is  not  linear.  The  detection 
or  calculation  of  the  distribution  curves  would  be  the  same,  only  the 
selection  problem  would  differ. 

For  other  problems,  in  which  the  distribution  curves  of  the 
acceleration  and  noise  are  not  normal,  one  would  in  theory  follow  the 
same  process  to  calculate  the  distribution  functions  of  predicted  position 
and  velocity.  Then  an  appropriate  criterion  would  be  used  in  the  selection 
process.  Since  the  calculation  of  the  distribution  curves  is  not  always 
possible  by  analytic  means  for  the  general  distribution,  it  is  hoped  that 
these  problems  may  be  solved  by  composing  the  actual  distributions  from 
sums  of  appropriately  chosen  normal  functions.  Then  the  distribution 
curves  of  the  predicted  quantities  will  be  combinations  of  sums  of  the 
type  of  solution  presented  here. 

There  are  certain  extensions  that  permit  further  constraints 
on  certain  of  the  variables,  such  as  requiring  the  acceleration  distribution 
to  be  a function  of  the  present  velocity,  etc.  These  concepts  are  very 
clear;  the  only  question  is  whether  solutions  of  these  more  general  forms 
can  be  done  analytically.  It  does  not  seem  economically  feasible  to  do 
these  calculations  with  some  special  analog  device,  although  it  is  perfectly 
possible  conceptually. 


Signed 


Approve 
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